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3 <110> APPLICANT: Darst, Seth A 

4 Zhang, Gongyi 

5 Campbell, ELizabeth 

6 Minakin , Leonid 

7 SeverinoV/ Konstantin 

9 <120> TITLE OF INVENTION: A CRYSTAL OF BACTERIAL CORE RNA POLYMERASE AND METHODS 
10 OF USE THEREOF 

12 <130> FILE REFERENCE: 600-1-258 

14 <140> CURRENT APPLICATION NUMBER: 09/782,714 

15 <141> CURRENT FILING DATE: 2001-02-13 

17 <150> PRIOR APPLICATION NUMBER: 09/396,651 

18 <151> PRIOR FILING DATE: 1999-09-15 
21 <160> NUMBER OF SEQ ID NOS : 4 
23 <170> SOFTWARE: Patentln Ver . 2.0 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 1525 

27 <212> TYPE: PRT 

28 <213> ORGANISM: Thermus aquaticus 

30 <220> FEATURE: 

31 <221> NAME/KEY: SITE 

32 <222> LOCATION: (1247) 

33 <223> OTHER INFORMATION: Any amino acid can be at this position 

35 <400> SEQUENCE: 1 

36 Met Lys Lys 

37 1 

39 Lys lie Arg 
40 

42 Asn Tyr Arg 

43 35 

45 lie Phe Gly 

46 50 
4 8 Arg Gin Arg 
49 65 

51 Thr Arg Ser 
52 

54 Thr Pro Ala 
55 

57 Gly Thr Leu 

58 115 

60 Phe Asn Lys 

61 130 

63 Val Pro Val 

64 145 

66 Leu Arg Tyr 
67 

69 Ala Leu Val 
70 
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265 










270 






87 


Arg 


Gin 


Glu 


Glu 


Glu 


Val 


Val 


Ala 


Arg 


Tyr 


Phe 


Leu 


Pro 


Ala 


Gly 


Met 


88 






275 










280 










285 








90 


Thr 


Pro 


Leu 


Val 


Val 


Glu 


Gly 


Glu 


He 


Val 


Glu 


Val 


Gly 


Gin 


Pro 


Leu 


91 




290 










295 










300 










93 


Ala 


Glu 


Gly 


Lys 


Gly 


Leu 


Leu 


Arg 


Leu 


Pro 


Arg 


His 


Met 


Thr 


Ala 


Lys 


94 


305 










310 










315 










320 


96 


Glu 


Val 


Glu 


Ala 


Glu 


Glu 


Glu 


Gly 


Asp 


Ser 


Val 


His 


Leu 


Thr 


Leu 


Phe 


97 










325 










330 










335 




99 


Leu 


Glu 


Trp 


Thr 


Glu 


Pro 


Lys 


Asp 


Tyr 


Lys 


Val 


Ala 


Pro 


His 


Met 


Asn 


100 








340 










345 










350 






102 


Val 


He 


Val 


Pro 


Glu 


Gly 


Ala 


Lys 


val 


Gin 


Ala 


Gly 


Glu 


Lys 


He 


Val 


103 






355 










360 










365 








105 


Ala 


Ala 


He 


Asp 


Pro 


Glu 


Glu 


Glu 


Val 


He 


Ala 


Gin 


Ala 


Glu 


Gly 


Val 


106 




370 










375 










380 










108 


Val 


His 


Leu 


His 


Glu 


Pro 


Ala 


Ser 


He 


Leu 


Val 


Val 


Lys 


Ala 


Arg 


Val 


109 


385 










390 










395 










400 


111 


Tyr 


Pro 


Phe 


Glu 


Asp 


Asp 


Val 


Glu 


Val 


Thr 


Thr 


Gly 


Asp 


Arg 


Val 


Ala 


112 










405 










410 










415 




114 


Pro 


Gly 


Asp 


Val 


Leu 


Ala 


Asp 


Gly 


Gly 


Lys 


val 


Lys 


Ser 


Glu 


He 


Tyr 


115 








420 










425 










430 






117 


Gly 


Arg 


Val 


Glu 


Val 


Asp 


Leu 


Val 


Arg 


Asn 


Val 


Val 


Arg 


Val 


Val 


Glu 


118 






435 










440 










445 








120 


Ser 


Tyr 


Asp 


He 


Asp 


Ala 


Arg 


Met 


Gly 


Ala 


Glu 


Ala 


He 


Gin 


Glu 


Leu 


121 




450 










455 










460 










123 


Leu 


Lys 


Glu 


Leu 


Asp 


Leu 


Glu 


Lys 


Leu 


Glu 


Arg 


Glu 


Leu 


Leu 


Glu 


Glu 


124 


465 










470 










475 










480 


126 


Met 


Lys 


His 


Pro 


Ser 


Arg 


Ala 


Arg 


Arg 


Ala 


Lys 


Ala 


Arg 


Lys 


Arg 


Leu 


127 










485 










490 










495 




129 


Glu 


Val 


Val 


Arg 


Ala 


Phe 


Leu 


Asp 


Ser 


Gly 


Asn 


Arg 


Pro 


Glu 


Trp 


Met 


130 








500 










505 










510 






132 


He 


Leu 


Glu 


Ala 


Val 


Pro 


Val 


Leu 


Pro 


Pro 


Asp 


Leu 


Arg 


Pro 


Met 


Val 


133 






515 










520 










525 








135 


Gin 


Val 


Asp 


Gly 


Gly 


Arg 


Phe 


Ala 


Thr 


Ser 


Asp 


Leu 


Asn 


Asp 


Leu 


Tyr 


136 




530 










535 










540 










138 


Arg 


Arg 


Leu 


He 


Asn 


Arg 


Asn 


Asn 


Arg 


Leu 


Lys 


Lys 


Leu 


Leu 


Ala 


Gin 


139 


545 










550 










555 










560 


141 


Gly 


Ala 


Pro 


Glu 


He 


He 


He 


Arg 


Asn 


Glu 


Lys 


Arg 


Met 


Leu 


Gin 


Glu 


142 










565 










570 










575 




144 


Ala 


Val 


Asp 


Ala 


Val 


He 


Asp 


Asn 


Gly 


Arg 


Arg 


Gly 


Ser 


Pro 


Val 


Thr 



file://C:\CRF3\Outhold\VsrI782714.htm 



3/29/01 



Page 3 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/782,714 



DATE: 03/29/2001 
TIME: 09:59:32 



Input Set : N:\Crf3\RULE60\09782714.txt 
Output Set: N:\CRF3\03292001\I782714.raw 



14 5 



580 



585 



590 



14 7 Asn Pro Gly Ser Glu Arg Pro Leu Arg Ser Leu Thr Asp lie Leu Ser 
148 595 600 605 

150 Gly Lys Gin Gly Arg Phe Arg Gin Asn Leu Leu Gly Lys Arg Val Asp 

151 610 615 620 

153 Tyr Ser Gly Arg Ser Val lie Val Val Gly Pro Gin Leu Lys Leu His 

154 625 630 635 640 

156 Gin Cys Gly Leu Pro Lys Arg Met Ala Leu Glu Leu Phe Lys Pro Phe 

157 645 650 655 

159 Leu Leu Lys Lys Met Glu Glu Lys Ala Phe Ala Pro Asn Val Lys Ala 

160 660 665 670 

162 Ala Arg Arg Met Leu Glu Arg Gin Arg Asp lie Lys Asp Glu Val Trp 

163 675 680 685 

165 Asp Ala Leu Glu Glu Val lie His Gly Lys Val Val Leu Leu Asn Arg 

166 690 695 700 

168 Ala Pro Thr Leu His Arg Leu Gly lie Gin Ala Phe Gin Pro Val Leu 

169 705 710 715 720 

171 Val Glu Gly Gin Ser lie Gin Leu His Pro Leu val Cys Glu Ala Phe 

172 725 730 735 

174 Asn Ala Asp Phe Asp Gly Asp Gin Met Ala Val His Val Pro Leu Ser 

175 740 745 750 

177 Ser Phe Ala Gin Ala Glu Ala Arg lie Gin Met Leu Ser Ala His Asn 

178 755 760 765 

180 Leu Leu Ser Pro Ala Ser Gly Glu Pro Leu Ala Lys Pro Ser Arg Asp 

181 770 775 780 

183 lie He Leu Gly Leu Tyr Tyr He Thr Gin Val Arg Lys Glu Lys Lys 

184 785 790 795 800 

186 Gly Ala Gly Met Ala Phe Ala Thr Pro Glu Glu Ala Leu Ala Ala Tyr 

187 805 810 815 

189 Glu Arg Gly Glu val Ala Leu Asn Ala Pro He Val Val Ala Gly Arg 

190 820 825 830 

192 Glu Thr Ser Val Gly Arg Leu Lys Phe Val Phe Ala Asn Pro Asp Glu 

193 835 840 845 

195 Ala Leu Leu Ala Val Ala His Gly Leu Leu Asp Leu Gin Asp Val val 

196 850 " 855 860 

198 Thr Val Arg Tyr Leu Gly Arg Arg Leu Glu Thr Asn Pro Gly Arg He 

199 865 870 875 880 

201 Leu Phe Ala Arg He Val Gly Glu Ala Val Gly Asp Glu Lys Val Ala 

202 885 890 895 

204 Gin Glu Leu He Gin Met Asp Val Pro Gin Glu Lys Asn Ser Leu Lys 

205 900 905 910 

20 7 Asp Leu Val Tyr Gin Ala Phe Leu Arg Leu Gly Met Glu Lys Thr Ala 
208 915 920 925 

210 Arg Leu Leu Asp Ala Leu Lys Tyr Tyr Gly Phe Thr Leu Ser Thr Thr 

211 930 935 940 

213 Ser Gly He He Thr He Gly He Asp Asp Ala Val He Pro Glu Glu 

214 945 950 955 960 

216 Lys Gin Arg Tyr Leu Glu Glu Ala Asp Arg Lys Leu Arg Gin He Glu 

217 965 970 975 
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1365 



1370 



1375 



294 


Leu Lys Tyr 


Val 


Glu 


Val 


Thr 


Asp 


Pro 


Gly 


Asp 


Ser 


Pro 


Leu 


Leu 


Glu 


295 


1380 








1385 








1390 






297 


Gly Gin val 


Leu 


Glu 


Lys 


Trp 


Asp 


Val 


Glu 


Ala 


Leu 


Asn 


Glu 


Arg 


Leu 


298 


1395 








1400 








1405 








300 


He Ala Glu 


Gly 


Lys 


Val 


Pro 


Val 


Ala 


Trp 


Lys 


Pro 


Leu 


Leu 


Met 


Gly 


301 


1410 








1415 








1420 










303 


Val Thr Lys 


Ser 


Ala 


Leu 


Ser 


Thr 


Lys 


Ser 


Trp 


Leu 


Ser 


Ala 


Ala 


Ser 


304 


1425 




1430 








1435 








1440 


306 


Phe Gin Asn 


Thr 


Thr 


His 


Val 


Leu 


Thr 


Glu 


Ala 


Ala 


He 


Ala 


Gly 


Lys 


307 




1445 








1450 








1455 




309 


Lys Asp Glu 


Leu 


He 


Gly 


Leu 


Lys 


Glu 


Asn 


Val 


He 


Leu 


Gly 


Arg 


Leu 


310 


1460 








1465 








1470 






312 


He Pro Ala 


Gly 


Thr Gly 


Ser 


Asp 


Phe 


Val 


Arg 


Phe 


Thr 


Gin 


Val 


Val 


313 


1475 








1480 








1485 








315 


Asp Gin Arg 


Thr 


Leu 


Lys 


Ala 


He 


Glu 


Glu 


Ala 


Arg 


Lys 


Glu 


Ala 


Val 


316 


1490 






1495 








1500 










318 


Glu Ala Lys 


Glu 


Lys 


Glu 


Ala 


Pro 


Arg 


Arg 


Pro 


val 


Arg 


Arg 


Glu 


Gin 


319 


1505 




1510 








1515 








1520 


321 


Pro Gly Lys 


Gly 


Leu 
























322 




1525 
























325 


<210> SEQ ID NO 


: 2 
























326 


<211> LENGTH: 1119 
























327 


<212> TYPE: 


PRT 


























328 


<213> ORGANISM : 


Thermus 


aquaticus 
















330 


<220> FEATURE: 


























331 


<221> NAME/KEY: 


SITE 






















332 


<222> LOCATION: 


(695) . . 


(696) 


















333 


<223> OTHER 


INFORMATION 


: Any amino acids can be 


at these two positions 


335 


<400> SEQUENCE: 


2 
























336 


Met Lys He 


Lys 


Arg 
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Gly 


Arg 


He 


Arg 


Glu 


Val 


He 


Pro 


Leu 


Pro 


337 


1 
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Pro Leu Thr 
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He 
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Val 


Glu 
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Lys 


Ala 
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Gin 


Ala 


340 
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Asp Val Pro 


Pro 


Glu 


Lys 


Arg 


Glu 


Asn 


Val 


Gly 


He 


Gin 


Ala 


Ala 


Phe 


343 


35 










40 










45 








345 


Lys Glu Thr 


Phe 


Pro 


He 


Glu 


Glu 


Gly 


Asp 


Lys 


Gly 


Lys 


Gly 


Gly 


Leu 


346 


50 








55 










60 










348 


Val Leu Asp 


Phe 


Leu 


Glu 


Tyr 


Arg 


He 


Gly 


Asp 
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Pro 


Phe 


Ser 


Gin 


349 


65 






70 










75 
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Asp Glu Cys 


Arg 
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Asp 


Leu 


Thr 


Tyr 


Gin 


Ala 
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Leu 


Tyr 


Ala 


352 






85 










90 
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354 


Arg Leu Gin 


Leu 


He 


His 


Lys 


Asp 


Thr 


Gly 


Leu 


He 


Lys 


Glu 


Asp 


Glu 


355 




100 










105 










110 






357 


Val Phe Leu 


Gly 


His 


Leu 


Pro 


Leu 


Met 


Thr 


Glu 


Asp 


Gly 


Ser 


Phe 


He 


358 


115 










120 










125 








360 


He Asn Gly 


Ala 


Asp 


Arg 


Val 


He 


Val 


Ser 


Gin 


He 


His 


Arg 


Ser 


Pro 


361 


130 








135 










140 










363 


Gly Val Tyr 


Phe 


Thr 


Pro 


Asp 


Pro 


Ala 


Arg 


Pro 


Gly 


Arg 


Tyr 


He 


Ala 



file.7/C:\CRF3\OutholdWsrI7827 1 4.htm 



3/29/01 




Page 6 of 7 



VERIFICATION SUMMARY DATE: 03/29/2001 

PATENT APPLICATION: US/09/7 82,714 TIME: 09:59:33 

Input Set : N:\Cxf3\RULE60\09782714.txt 
Output Set: N:\CRF3\03292001\l782714.xaw 

L:267 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:1 
L:465 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 



file://C:\CRF3\Outhold\VsrI7827 1 4.htm 



3/29/01 



